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TITLE; QUERY FILTER FOR QUERYING A MULTITUDE OF DATABASES 



FIELD OF THE INVENTION 



10001] The present invention is directed to a method, 
system and apparatus for executing SQL- like query 
selections on structured and semi-structured data files, 
independent o£ the file formats or file locations. In 
particular, the present invention provides a method, 
system and apparatus for expanding the embraced SQL 
SELECT grammar with user-supplied ordinary and group 
plug-ins functions. 



BACKGROUND OF THE INVENTION 
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[0002] With the amounts of available information doubling 
nearly every twenty years, seventy percent of 
information, according to the latest estimates, remains 
inaccessible or inactive, due to obsolescence or the fast 
pace of change. 

[0003] To circumvent this crisis, people in the 
information-processing field more than ever need 
coherent, adaptable data querying and reporting tools 
that provide analogous interface, regardless of the data 
source or data representation. 



[0004] Structured Query Language (SQL) has emerged as one 
of the most prevalent data query and reporting tool that 
navigates relational database-management systems (RDBMS) . 

[0005] Following SQL industry and academia acceptance, 
numerous versions Of coirmercially available SQL-alilce 
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dialects, tailored for specialized non-RDBMS needs, have 
appeared on the market. Just to name a few, these include 
SQL-based systems for viewing object-oriented or spatial 
information, for data mining, or for querying the web 
pages containing Extended Markup Language (XML) 
declarations . 

[0006] To suit its needs, an enterprise may store its 
data in variety of mediums: flat text files, web pages, 
relational, network, and/or object-oriented databases, 
data warehouses, or all of the above (just to name a 
few) . 

[0007] If the available data model does not fall into 
p the category for which an SQL-based dialect is 

^ commercially available, or, even worse, the enterprise 

Q 

jfl data model encompasses a variety of representations and 

formats, the company is forced to develop expensive 
1:3 customized applications to realize its basic querying or 

; reporting needs. To make things even more complicated, 

fy many of these applications are developed with the help of 

several programming languages r oftentimes simply because 

"s, 3 

□ one programming language lacks a simple feature, like the 

flow of control mechanism that is absent in basic SQL. 

[0008] It would be much simpler if one SQL-alike dialect 
could accommodate all kinds of data, be such data 
structured or semi -structured in its nature, especially 
data that comes from virtual databases scattered 
throughout the cyberspace, 

[0009] Many more elegant solutions can be achieved with 
expandable SQL-alike select grammar alone, if a user 
could only augment basic SQL grammar with new language 
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dictions performing special tasks that are absent or 
difficult to achieve with the means of the basic SQL 
alone . 

SUMMARY OF THE IISIVENTIOISI 

[00010] In attempt to disassociate SQL-alike interface 
from any potential data source or data format, and to 
provide a mechanism to expand the SQL select grammar with 
new dictions that suit unique business needs or otherwise 
difficult to achieve with basic SQL grammar ~ Qnery 
Filter was contrived as a universal and expandable tool 
for data querying and reporting, 

[00011] The foremost idea behind the Query Filter is to 
provide an SQL-based dialect that works with any data 
^-3 source and data format, providing that data source and 

data format are represented by an external computer 
program that supplies upon request one or more rows of 
data in a readable tabular format. 
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[00012] The second idea behind the Query Filter is to 
provide a flexible mechanism, called the plug- ins, to 
expand the SQL select grammar wii:h the new dictions. The 
plug- ins are user-written, preferably C programming 
language subroutines with the purpose o£ extending the 
capabilities of a basic SQL language, aimed at performing 
special tasks that are absent or difficult to achieve 
with the means of the basic SQL alone. 

[00013] It is not intended that the method, system and 
apparatus for providing the Query Filter be summarized 
here in its entirety. Rather, further features, aspects 
and advantages of the Query Filter are set forth in or 
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are apparent from the following drawings, detailed 
description and claims that follow. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[00014] Referring now to the drawings in which like 
reference numbers represent corresponding parts 
throughout : 

[00015] FIG- 1 presents a schematic of the Query Filter 
architecture - 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



[00016] It will be understood that the foregoing brief 
description and the following detailed description are 
exemplary and explanatory of the method, system and 
apparatus for providing the Query Filter, but are not 
intended to be restrictive thereof or limiting of the 
advantages which can be achieved by the Query Filter, 
O Thus, the accompanying drawings referred to herein and 

constituting a part hereof, illustrate embodiments of the 
Query Filter and, together with the detailed description, 
serve to explain the principles of the Query Filter, 

[00017] The Query Filter architecture involves three 
interconnected components: (a) The Query Filter, (b) The 
Data Extractor Application, and (c) The Database. See 
FIG. 1 for a schematic of the Query Filter architecture. 

[00018] The Query Filter is an SQL-like dialect for 
issuing and parsing relational queries. 
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[00019] The Data Extractor Application is a computer 
program that extracts the data or obtains information 
about the supported Database schema, and returns the 
database schema or the given columns of data as one row 
of text- 

100020] The Database is a local or distributed 
collection of structured or semi-structured data files. 

100021] The interconnection of the components may be 
accomplished by many means depending on the operating 
system of the computers running the databases and Query 
Filter- For example, for UNIX and UNIX-lilce operating 
systems such as LIjSIUX. as shown in the attached Figures, 
the interconnection may be achieved by a UNIX pipe. For 
other operating systems such as WINDOWS/ similar means 
for interconnection are available • 

[00022] The Query Filter working sessions that executes 
a typical SELECT statement will now be described. First, 
the user issues a SELECT Ofuery to the Query Filter. The 
Query Filter examines its operating system environment, 
parses the supplied SQL select dictions, and issues a 
call via the interconnection such as a UNIX read pipe to 
the external program, called the Data Extractor 
Application, to list its Database schema • 

[000233 The Data Extractor Application parses its input 
arguments, obtains information about the supported 
Database schema, and writes the requested information to 
the standard output. 

[00024] The Query Filter reads the Data Extractor 
Application reply via the UNIX read pipe interconnection 
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and validates the requested columns compliance with the 
supported Database schema. 

[00025] If the request is unsupported by the Database 
schema, the Query Filter terminates with a suitable error 
message . 

[00026] If the request is supported by the Database 
schema, the Query Filter issues a call (via the UNIX read 
pipe) to the Data Extractor Application to return the 
given columns of data as one row of text- 

[00027] The Data Extractor Application parses its input 
arguments/ obtains the requested col-umns of data from its 
Database, and returns the rows of extracted data as text 

l~ 

U by writing it to the standard output. 

in 

\jl [00028] The Query Filter reads the rows of data from the 

[f™ UNIX read pipe. The filtering, specified in the WHERE 

language clause is applied against the intended data 
coliimns- The rows of data that pass the filtering 

O selection only are written to the standard output, for 

''""4 

the user interpretation. 

[00029] The Query Filter is a computer program that 
generates the requested SQL SELECT reports over the 
columns of data supplied by the Data Extractor 
Application. 

[00030] The Query Filter communicates with the Data 
Extractor Application via a UNIX pipe mechanism by 
reading standard input from the UNIX pipe, or writing 
standard output to the UNIX pipe. 
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[00031] The Query Filter learns about the location of 

the Data Extractor Application via designated variable 

that is set within the user UNIX environment as follows. 

[00032] The Query Filter "<select clause>" grammar 
follows, with a few exceptions, a basic syntax of SELECT 
statement present in all popular SQL dialects. 

[00033] The supported Query Filter grammar in Backus 
Naur Form is as follows: 

[00034] SELECT [DISTINCT] column function | expr {, 
column I function | exr} FROM filename [file^alias] OF 
TYPE filetype [, filename file_alias OF TYPE filetype] [ 

M' WHERE search_condition] [ORDER BY (ASCENDING 1 DESCENDING] 

Pi column I function {/ column 1 

Q function }] [ GROUP BY column {, column} [HAVING 

f n search_cond] ] 

s 

^ [00035] Where: 

^1 [00036] column - name of the data column, supported by 

the data extractor application. To avoid ambiguities, 
name of the column may not be since data extractor 

application is allowed to support heterogeneous records 
present in one source data file* 

[00037] function - a built-in aggregate, mathematical, 
string, or time-handling function that operates on 
column's or stipulated values: 
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Table 1 



Function 


Dscription 


SUM(c) 


Computes the sum of column c 
numerical entries. 


AVG(c} 


Computes the average of column 
c momerical entries . 


COUNT (c) 
MIN(C) 


Computes the number of column 
c entries. 

Computes the minimum value of 
column c textual entries. 


NMIN(C) 


Computes the minimum value of 
column c numerical entries. 


MAX(c) 


Computes the maximum value of 
column c textual entries. 


NMAX(C> 


Computes the maximum value of 
column c numerical entries. 


SQRT(x) 


Computes the square root of x. 


LOG10{x) 


Computes the base- 10 logarithm 
of X. 


LOG(X) 


Computes the natural logarithm 
of X, 


EXP (x) 


Computes the exponential 
function e**x. 


POW(x, y) 


Computes x raised to the power 
of y, x**y. 


ABS(x) 


Returns the absolute value of 
its int operand x. 


SIN{x) 


Return trigonometric function, 
sin ( ) , of radian argument x . 


COS (X) 


Return trigonometric function, 
cosO. of radian argument x. 


TAN(x, y) 


Return trigonometric function, 
tan ( ) , of radian arguments x 
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ASIN(x) 


Returns the arc sine of x in 
fhe T"ancre -n/2 to n/2 • 


ACQS (X) 


Returns the arc cosine of x in 


ATAN(X) 


Returns the arc tangent of x 
in the range -n/2 to n/2- 


ATAN2(y/ X) 


Convert rectangular 
coordinates (x,y) to polar 
(r,0); atan2(y,x) computes 0, 

computing an arc tangent of 
y/x in the range -n to n. 


CEIL{x) 


Returns the least integral 
Tr^tlnft rfY'f^A^ than or ©cnial to 

X, 


FLOOR (X) 


Returns the greatest integral 
value less than or equal to x. 


RINT (X) 


Kounos X EO an muegirax vaxue 
according to the current 
IEEE754 rounding direction. 


STRLEN ( s ) 


textual string 's' or coliinin's 
s textual value. 


REPLACE ( c , * target^s tring, 

' replaceinent_striiicf ' } 


Replaces all occurrences of 

uiie carg e u^s L.I: xng wiun txie 
' replaceinent_string' in the 
coluion c. 


SUBSTR ( c . Star t_index, 
s UJD s E 1 en J 


Returns a sub -string from the 

V^V/XWlUil v.* tDUClJLUXxxy dU 

start_index (1„.) of the 
maximum length of substr_len. 
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□ 



STRCAT(stringl, string2) 


Catenate string2 to stringl. 


CTIME( [CC]yYMMDD, HHMMSS ) 


Returns time in seconds since 

00:00:00 UTC, January 1, 1970. 

rr-J Trfii-n Aa^^ F 1 YYMMDD and. tiltlS 

HHMMSS colTxmns or values, like 

CTIME( ■ 19990112 ' , '232215'). 


PLUGIN2 ( £unc_name / arg2 ) 


plug- ins function of 2 
arguments . 


PLUG IN3 ( f unc_xiame , ar g2 , 
arg3) 


U SSr ""QSX jLiiSCl ix^lX ciy y JL csy auc: 

plug-ins function of 3 
arguments - 


PLUGIN4 ( f unc^name , arg2 , 
arg3, arg4) 


US©r "^a©iiiie<-i nwii ciyyicsyouc 

plug-ins function of 4 
arguments . 


PLUGIN5 ( f unc_name , ar g2 , 
arg3, arg4, argS) 


user-aennea non-agyieyat.e 
plug-ins function of 5 
arguments . 


PLUGIN6 ( f unc_narne , arg2 , 
arg3, arg4, argS, arg6) 


USer~Qexmeci nuii o,yyjLc;ycivc; 
plug-ins function of 6 

ctx y uULiexi L> o - 


GRPLUGIN2 ( f \mc_name , arg2 ) 


User-defined aggregate plug- 


GRPLUGIN3 ( f unc_narQ.e , ar g2 , 
arg3) 


User-defined aggregate plug- 
ms iuncuion or ^ aryuiueiius . 


GRPLUGIN4 ( f -unc.name , ar g2 , 
arg3, arg4) 


User-defined aggregate plug- 
ins Luncuxon oe 4t aryuiuenus - 


GRPLUGIN5 { f unc^naine , arg2 , 
arg3 , arg4, argb; 


User-defined aggregate plug- 

ms Luncuion ox d airy uiueiius - 


fjR pr.TTfiTlvIfi ^ f "iiTit^ natri^ aT"or2 

\JV\.S \ i iAA*^»^_AiW4*l.^ * Wiy^ f 

arg3, arg4, argS, arg6) 


GRPLiUGZNS ( f unc name . arQ2 , 
arg3, arg4, argS, arg6) 


GRPLUGIN9 ( f unc_naine , arg2 , 


User-defined aggregate plug- 
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arg3, 


arg4, 


arg5 , arg6 , 


ins function of 9 arguments. 


arg7. 


argS/ 


arg9) 





[00038] Search_condition - contains separate comparisons 
for strings and numbers. By default all values are 
textual strings. Using a numerical, borrowed from the 
FORTRAN language; comparison or a function enforces data 
conversion to the appropriate numeric type. 

[00039] HAVING search_condition - Follows the GROUP BY 
clause and can contain the same kind of search condition 
you may specify in a WHERE clause. 



Table 2 





Comparison 


Description 






Textual ecfual , 






Textual equal. 






Not equal, 


ffl 


>= 


Textual greater or equal. 




NGE 


Numerical greater or equal . 




<= 


Textual less or equal. 


H 


NLE 


Numerical less or equal. 


V '"I 


> 


Textual greater than. 




NGT 


Numerical greater than. 




< 


Textual less than. 




NLT 


Nximerical less than. 




LIKE 'string%^ 


1 LIKE % 


Returns true if matching 
string is found, false 
otherwise. 




AND 


AND clause. 




OR 


OR clause* 




XOR 


Exclusive OR clause. 




NOT 


Not clause. 
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[00040] expressions - may contain functions and math 
statements involving operators applied on integer and 
real numbers: 



Table 3 



Operator 


Description 


+ 


Addition. 




Subtraction- 




Division. 


* 


Multiplication. 


& 


Binary AND, 


1 

1 


Binarv OR 


% 


Mudule operator, 
a % b. Divides 
two numbers and 
returns only the 
remainder . 



[00041] Query Filter Environment requires setting of the 
following variables: 

[00042] A) Mandatory environment variable, 
QUERy_PILTER_DATA_EXTRACTOR, must point to the location 
of the data extractor application. For example, a csh 
user can set Query Filter environment by typing the 
following command (beware, that in each case the name of 
the individual data extractor application will differ) : 

[00043] setenv QUERY_FILTERJDATA_EXTRACTOR 
-/bin/my_data_extractor 
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[00044] The data extractor application is a user-written 
stand-alone application that knows only about the 
Database (source data file(s)) schema and how to read the 
colurans of data from the stipulated source file of the 
given type. The user of the Query Filter is responsible 
for developing or acciuiring a data extractor application 
suitable for his or her data organization. The data 
extractor application interface protocol is 
straightforward and presented in the sections below. 

[00045] B) Optional environment variable, 
QUERY_FILTER_DATA_EXTR^CTOR_CACHE, must point CO the 
directory to be used for intermediate storage of results. 
Usage of this environment variable is not mandatory but 
strongly recommended for faster processing when files 



[00046] C) Optional environment variable, 
QUERY_FILTER_OUTPTJT_FILE, must point to a valid file 
location path. This variable may be used for sending 
output to the file directly, in addition to stdout. 
[00047] The Data Extractor Application is an external 
computer program supplied by the user with the purpose to 
provide information to the Query Filter about the 
Database schema, and to return the requested columns of 
data as a rows of text. 



[00048] The Data Extractor Application may be written in 
any practical programming language of choice and must 
adhere to two types of the interface protocols . 

[00049] The Data Extractor Application must adhere to 
two types of interface protocols: 




aliases are used. 
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[00050] The first type of interface protocol deals with 
acquiring tabulated data from the database. 

[00051] The following synopsis irtust be observed: 



TyPE> filedl:field2: . . . [fieldl: , . . f ieldN] " 
[000531 Where: 

[00054] <m PILE NAME> - Full path of your data file; 
<MY FILE TYPE> - Name of my file data type; 

[00055] fieldl:field2: ... [fieldl: .fieldN] -Column, 
separated list of asked columns, 

[00056] with the same column may be asked more than 
once; 

[00057] - Fields separator. 

[00058] Tabulated output (no blanks within columns) is 
directed to stdout/ with comma acting as a fields 
separator . 

[00059] The second type of the interface protocol deals 
with discovering a database schema. 

[00060] The following synopsis must be observed: 

r00061]my_data_extractor <MY FILE NAME> <My FILE 
TYPE> -help 



[00052] 



my_da t a_ex t r a c t or 



<MY FILE NAME> 



<MY FILE 
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[00062] Record <MY RECORD NAME ONE> 
(fieldl:field2: fieldN) 

Record <MY RECORD NAME TWO (f ieldl: f ield2 : fieldN) 



Record <m RECORD NAME N> (f ieldl : field2 : fieldN} 

[00063] Here, braces that encompass column- separated 
fields names must follow each record name. 

[00064] Tlie following examople is a case in point of a 
simple Database and its Data Extractor Application, 

[00065] The Database (a flat file) Company_Widget_Price 
contains Company, Widget and Price columns as seen below: 

[00066] Company C, Widget_C 300.0 

Company A . Wi dge t_A 100-0 

Company B. Widget_B 200.0 

The Company Widget_T 500.0 

[00067] The Data Extractor Application that reads this 
Database or provides information on its schema may be 
realized in ksh as follows: 
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Table 4 

# ! /bin/ksh 
## 

# Extract designated fields from the file's records. 
# 

# Synopsys: 
# 

# data_extractor.ksh <file_naine> <file_type> 
f ieldl : f ield2 : . . . f ieldN " , " 

# 

# data_extractor . ksh <file_name> <file_type> -help 
# 

## 

export file_naine="$l" 
export file_type="$2'' 
export fields="$3" 
e^qport separ="$4" 

if [ "$separ'' == ] 
then 

escort separ=" , " 
fi 

## 

# Output a usage message. 
## 

fimction usage_jnaessage 
{ 

echo "NnUsage is:\n" 

echo "data_extractor.ksh <file_name> <file_type> 
f ieldl : f ield2 : . . . f ieldN \ " , \ " \n " 

echo "data_extractor .ksh <file_name> <file_type>- -help\n" 
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} 

if [ "$file_type" == "Company_Widget_Price" I 

then 
## 

# Provide a help only. 
## 

if [ "$fields" == "-help" ] 
then 

echo "Record (Company : Widget : Price) " 

exit 0 

fi 

if [ "$fields" == "» ] II [ "$separ" == ] 
then 

usage_inessage 

exit 1 

fi 

## 

# Insert a blank instead of a fields separator. 

## 

export Fields='echo $fields | tr ' : • ' 
## 

# Read a record and output selected fields only, separated 
by delimiter- 

## 

while read line 
do 

if [ "$line" == " »' ] 
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then 

continue 
fi 

unset Arrival 

set +A Arrival $line 

## 

# Concatenate with, _ .if field, Company, consists from two 
words • 

## 

if [ "${#Arrival[*] }" == "4" ] 
then 

export conipany=$ {Arrival [0] }"_"${ Arrival [1] } 
export widget=$ {Arrival [2] } 
export price=$ {Arrival [3] } 
else 

export coitipany=$ {Arrival [ 0 ] } 
export widgeti=$ {Arrival [1] } 
export priGe=$ {Arrival [2] } 
fi 

## 

# Select regusted fields only + separator. 

## 

export oline="" 
word_separ=" " 

for field_jaame in $Fields 
do 

if [ "$field_name" == "Company" ] 
then 
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expor t o 1 ine= $ o 1 ine $ wor d_s epar $ company 
fi 

if [ "$field_name" == "Widget" ] 
then 

export oline=$oline$word_separ$widget 

fi 

if [ "$field_name" == "Price" ] 
then 

export oline=$oline$word_separ$price 
fi 

export wor d_s epar =Ssepar 
done 

## 

# Output the requested fields separated by delimiter. 
## 

if [ "$oline" 1= ] 
then 

echo $oline 
fi 

done <$file_name 
else 

usage_message 

exit 1 

fi 

exit 0 
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[00068] With Data Extractor Application in place, a 
typical user session may then proceed as follows: 

[00069] setenv 

QUERY_FILTER_DATA„EXTRACTOR= . /data^extractor . ]csh 
-/query_£ilter "SELECT REPLACE (Company, ' Price 

FROM . /Company_Widget_Price OF TYPE Company_Widget_Price 
WHERE Price GE 300 -0" 



[00070] Company C. 300.0 
The company 500 . 0 

[00071] Please, note, the use of the function REPLACE(). 
This function accepts a tabulated field value that may 
contain several words joined by a hyphen inside the Data 
Extractor Application. For output purposes hyphens are 
replaced with the blanlcs. 



[00072] The Database may comprise a set of data files, 
relational databases, HTML pages, XML pages, any other 
known data source or combination of all of the above 
located locally or dispersed through a cyberspace . 

[00073] The Database may be well structured or irregular 
and incomplete, called semi -structured by the research 
community? Data lacking well defined constrained 
structure, or data whose structure may change rapidly and 
unpredictably (like a WEB page 
with XML declarations) . 
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[00074] The Query Filter in addition to the embraced SQL 
SELECT grammar supports the user-supplied plug-ins 
functions . 

[0007 5] The notion of the plug-ins functions is to 
offset limitations of the basic SQL SELECT dialect 
grammar by giving user a choice to write new dictions 
that are chiefly required by the business function, or 
that are otherwise difficult to achieve with the basic 
SQL grammar. 

[00076] A case in point of employing the user supplied 
plug- ins function is to support selection of every 2nd 
row of data. The basic SQL SELECT grammar does not 
support the notion of the periodic counters. However, by 
Q implementing the plug-ins function EVERY_NTH, selection 

ffj of every 2nd row reduces to a common SELECT statement 

with the PLUGN3 {EVERy_NTH, 2, Widget) eq 1 statement in 
rfi its where clause, 

fil [00077] This example shows usage of the plug-ins 

C3 function, EVERY_NTH, to select every second row 

fi containing a widget. 

[00078] The Database (a flat file) Company_Widget_Price 
contains Company/ Widget and Price columns as seen below: 

[00079] Company C. Widget_C 300.0 
Company A. Widget^A 100.0 
Company B, Widget_B 200.0 

The Company Widget_T 500.0 

[00080] The following Query Filter statements select 
every second row from the Database. 
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[00081] S e t env QUERY^F ILTER_DATA_EXTRACTOR 
. /data_extractor . ksh 

./(juery^filter "SELECT REPLACE (Company, ' 
Widget, Price FROM . /Coiapany_Widget_Price OF TYPE 
Company__Widget_Price WHERE 
PLUGN3 (EVERY_NTH, 2, Widget) eq 1 

[00082] Company A. Widget^ 300.0 
The company Widget_T 500.0 

[00083] The plug- ins functions are the mechanism to 
expand the SQL finite grammar. To provide the binding. 
The Query Filter program is supplied with its source code 
and the makefile, a script that invokes compilation and 
linkage of the programming language into executable 
machine instructions. This makes writing plug-ins very 
easy, while not restricting what they can do. 

[00084] The Query Filter C-language source code contains 
non- obfuscated module, called the plugin.c. This module 
contains a modifiable table of supported plug-ins 
function names and associated C-language subroutines that 
implement and invoke these plug-ins functions . 

[00085] To implement a new plug- ins function, the user 
maast observe the following three steps inside the 
plugin.c module: (a) update the table of the supported 
plug-ins functions with the new plug-ins function name; 
(b) implement the associated subroutine that executes the 
plug-ins function logic; (c) code invocation of the plug- 
ins subroutine from within the appropriate plug- ins 
invocation subroutine. 
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[00086] The user-supplied plug-ins functions may be of 
two kinds: ordinary, like SINCcoluinn name) or COS (column 
name)) ; or group, that is operating on a set of values, 
like MINS (column name) or SUM (column name) . 

[00087] The plug-ins invocation subroutines are 
predefined subroutines inside the plugin.c module with 
names starting from the PLUGINl ( ) , PLUGIN2 ( ) , 
,..PLUGIN<N> 0 , or GRPLUGINl ( ) , GRPLUGIN2 ( ) , GRPLUGIN<N> ( ) . 

[00088] The number at the end of the plug- ins invocation 
subroutine name, like 'PLUGIN3 () , indicates the maximum 
number of arguments supplied to this subroutine. 

N' Accordingly, invocation of the user-siapplied plug-ins 

subroutine with N arguments must occur from within the 

Q plug- ins invocation subroutine that supports the same 

Tf^ number of arguments. 

l:Q 

[00089] The plug-ins invocation subroutines with names 
i"^ starting from the PLUGINl () , PLUGIN2(), «.PLUGIN<N>() are 

; Li 

PI used for invocation of the ordinary user-sijipplied plug- 

^'M ins functions that compute the results right away. 

: s 

[00090] The plug-ins invocation subroutines with names 
starting from the GRPLUGINl (), GRPLUGIN<N> ( ) are used 
for invocation of the group user-supplied plug-ins 
functions that compute results over a set of values. At 
the end of the data set, the Query Filter automatically 
invokes the group plug- ins functions with all arguments 
set to NULL; and this is when the final results should be 
computed and output. 
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[00091] This exaicple shows implementation and usage of 
the group plug- ins function, MY^AVERAGE, to compute the 
average price of the widget. 

[00092] First step is to implement the new plug-ins 
function MY_AVERAGE and add it to the plugin.c module. 
The following code snippet shows all places affected by 
the introduction of the new plug-ins function inside the 
plugin,c module, 

Table 5 

Char*" plugs_ins[] = { 



MY_AVERAGE", 



T; 



static char* MyAverage (char* number); 



char* grplugin2 (char* name, char* arg2 /*Plugs-in 
invocation subroutine */ 

1 



if (strcmp(name, ^^MY_AVERAGE" ) == 0) 
{ ~ 
return (MyAverage (arg2 ) ) ; 

> 

else if (strcmp(name , /* Previous plug- 
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ins invocation */ 

_ 



char* MyAverage(char* number) 
_ 

static long count = 0; 

static long smtv =0; 

static char^' result [BUFSIZ] ; 



result[0] ='\0'; /* Output result should 

contain empty string, unless it is the last call */ 



if (nurnber) 
_ 

count++; 

sum += atoi (number) ; 

_ 

else 
_ 

sprintf (result, ''%ld'S sum / count); 

_ 



return (result) ; 

} 



[00093] The next step is to recompile and to link the 
Query Filter, by issuing the make command . 

100094] Once make completes successfully the Query 
Filter is ready for use. 
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[00095] The Database (a flat file) Company_Widget_Price 
contains Company, Widget and Price columns as seen below: 



Company C. Widget^C 300.0 

Company A. Widget^A 100.0 

Company B. Widget_B 200.0 

The Company Widget^T 500.0 



[00096] The following Query Filter statements compute 
the average price of the widget from the company 
Database. 



setenv QUERY_FILTER^DATAJEXTRACTOR 
. /data_extractor . ]csh 



./query_filter "SELECT grplugin2 {MY_AVERAGE, Price) FROM 
in . /Company_Widget_Price OP TYPE Company JWidget_Price « 

[00097] Invocation of the new user-supplied plug- ins 
P^j subroutine from inside the plug- ins invocation subroutine 

U should not bloclc preceding user-supplied plug- ins 

^ subroutines from invocation. For that purpose, the first 

M argument supplied to the plug-ins is reserved for the 

plug-ins name, liJce EVERY31TH or MY_AVERAGE in our 
examples- The name of the plug- ins is used to 
discriminate which plug- ins subroutine to invoJce. 



[00098] The data types of all the arguments passed to the 
plug-ins or returned from the plug-ins subroutines should 
be pointers to characters. 

[00099] To ma]<e the Query Filter available for use, the 
plugin.c module must be complied and linlced with the rest 
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of the Query Filter source code, using the UNIX make 
utility. 

[000100] Although illustrative eihbodiments have been 
descried herein in details, it should be noted and 
understood that the descriptions have been provided for 
purposes of illustration only and that other variations 
both in form and detail can be made thereupon without 
departing from the spirit and scope of the method, system 
and apparatus for providing a Query Filter. The terms and 
e>^ressions have been used as terms of description and 
not terms of limitation. There is no limitation to use 
the terms or expressions to exclude any ecjuivalents of 
features shown and described or portions thereof, and the 
Query Filter should be defined with the claims that 
follow. 



DEC 07 '01 15:47 



- 27 - 



416 368 1645 PRGE.32 



